Search CORE

2 research outputs found

Pessimistic Off-Policy Multi-Objective Optimization

Author: Alizadeh Shima
Bhargava Aniruddha
Gopalswamy Karthick
Jain Lalit
Kveton Branislav
Liu Ge
Publication venue
Publication date: 28/10/2023
Field of study

Multi-objective optimization is a type of decision making problems where multiple conflicting objectives are optimized. We study offline optimization of multi-objective policies from data collected by an existing policy. We propose a pessimistic estimator for the multi-objective policy values that can be easily plugged into existing formulas for hypervolume computation and optimized. The estimator is based on inverse propensity scores (IPS), and improves upon a naive IPS estimator in both theory and experiments. Our analysis is general, and applies beyond our IPS estimators and methods for optimizing them. The pessimistic estimator can be optimized by policy gradients and performs well in all of our experiments

arXiv.org e-Print Archive

A data-driven iterative refinement approach for estimating clearing functions from simulation models of production systems

Author: Atherton Linda F.
Buzacott John A.
Conover William Jay.
Hannah Lauren A.
Hopp Wallace J.
Howard Ronald A.
Karmarkar Uday S.
Karthick Gopalswamy
Pochet Yves
Reha Uzsoy
Vollmann T. E.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref